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AN A POSTERIORI ANALYSIS OF SOME INCONSISTENT, NONCONFORMING 
GALERKIN METHODS APPROXIMATING ELLIPTIC PROBLEMS 

TRISTAN FRYER 


Abstract. In this work we present an a posteriori analysis for classes of inconsistent, nonconforming 
schemes approximating elliptic problems. We show the estimates coincide with existing ones for interior 
penalty type discontinuous Galerkin approximations of the Laplacian and give new estimates for inconsis¬ 
tent discontinuous Galerkin approximation schemes of elliptic problems under quadrature approximation. 
We also examine the effect of inconsistencies on the a posteriori analysis of schemes applied to an unbalanced 
problem. 


1. Introduction 

The design of efficient numerical approximations of partial differential equations (PDEs) is of paramount 
importance in scientific computing. To this end a large amount of work has been carried out on finite element 
methods and, in particular, on the a posteriori analysis of such methods. The a posteriori framework allows 
for the construction of adaptive methods that, given a fixed number of degrees of freedom, are able to better 
approximate nonsmooth solutions of PDEs than the nonadaptive counterparts |AO001 IVer96| . 

Discontinuous Galerkin (dG) methods form a class of finite element approximation schemes. These, as 
the name suggest, are constructed without enforcing continuity of the discrete solution over the domain. 
For second order elliptic problems these are nonconforming methods. See |ABCM0^ for an accessible 
overview and history of these methods for second order problems. For higher order problems, for example 
the (nonlinear) biharmonic problem, dG methods are a useful alternative to using conforming elements 
whose derivation and implementation can become very complicated [Bak77[ IGHVlll |Pryl4| . The historical 
origins of the method can be traced back to that of the transport equation and, as such, the dG methods in 
the non-selfadjoint case, for example convection dominated problems, are extremely powerful |Coc99) . 

In the elliptic case there are two main approaches to the a posteriori analysis of such nonconforming 
schemes. The first is based on a Helmholtz decomposition of the error IBHLO.Ij and the second based on 
reconstruction operators [KP03) . The reconstruction approach is extremely versatile and has been applied 
not just to elliptic problems but also to those of hyperbolic type [GHM141 IGMP15j . In both scenarios the 
main idea is to develop an appropriate reconstruction of the dG object which is itself smooth enough to 
apply stability arguments of the underlying PDF. The blanket assumption in the elliptic case is that, up 
to mesh dependent terms, the dG approximation is “close enough” to a conforming approximation which is 
indeed true for primal methods chosen with large enough penalty parameters [DPE12] . 

In this work we utilise these reconstruction operators to develop an abstract a posteriori framework for 
use in elliptic problems. We are particularly interested in the effect of inconsistencies in the estimate. The 
inconsistencies that we on focus may arise in various forms, quadrature approximation being the most obvious 
example. This has been studied previously in the form of data oscillation in the convergence of /i-adaptive 
algorithms [MNSOOl c.f.]. Another related work is [NSSVOh] where the authors study quadrature effects of 
which arise from a nonlinear source term. Inconsistencies may come from other sources though, for example, 
stabilisation introduced in the approximation of convection dominated problems |Bur05j may be viewed in 
this fashion. Mixed methods like an operator splitting of a nonconstant coefficient fourth order problem or 
discretisation of a nonvariational problem |LP111 IDP13) are inherently inconsistent as are those of virtual 
element type [BdVBG~*~1.3| . With this range of inconsistencies in mind we build our a posteriori estimate to 
resemble Strang’s Lemma, the starting point when conducting a priori error analysis of inconsistent methods. 
We show that as long as a reconstruction operator is available, inconsistencies such as those mentioned above 
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are a posteriori controllable. Interestingly, depending on the class of inconsistency, it is not always necessary 
to a posteriori compute the reconstruction, in some cases it suffices to only have existence of such an operator. 

By building the a posteriori bounds in this way and using ingredients from standard a posteriori analysis 
we can deal with problems as diverse as unbalanced problems like linear nonvariational and nonlinear p- 
Laplacian or p-biharmonic type, although in this introductory work we will restrict our attention to linear 
problems. Specifically, to illustrate the approach, we initially focus on the comparison of two dG discreti¬ 
sations of the Laplacian which are standard in the literature. We move onto the a posteriori effect that 
quadrature error has on finite element discretisations of a model problem. We show that the estimator can 
be split into components of which some terms can be computed exactly using the same quadrature approx¬ 
imation in the scheme and others requiring further approximation. We also examine unbalanced elliptic 
problems that are nonvariational. We derive a posteriori bounds which are shown to be reliable and 
efficient up to data oscillation terms. We also examine further effects of quadrature approximation in this 
setting and compare with the variational case. 

The paper is set out as follows: In ^we look at an abstract model problem and develop an a posteriori 
Strang type estimate. In we then illustrate how this framework can be utilised for a model problem 
and consider inconsistencies arising from quadrature approximations. In Q we study the application of 
unbalanced elliptic problems, these are nonvariational in nature and we conduct an a posteriori analysis 
of existing schemes proposed for this class of problem. Finally, in ^ we summarise extensive numerical 
experiments testing the robustness of the proposed estimators. 

2. Abstract problem setup 


Throughout this section we develop an abstract a posteriori framework for using notation borrowed from 
[EG041 §2]. To that end let VF be a Banach space and V a reflexive Banach space, equipped with norms 
ll'll^y , Ij-lly respectively. Let £/ :VFxE—^-IRbea bilinear form and Z : F —IR be a linear form. Consider 
the abstract problem to find u G W such that 

(2.1) £/{u,v) = l{v) VvGV. 


2.1. Assumption (The continuous problem is inf-sup stable). Assume that this problem is stable, in the 
sense that for some 7 > 0 


( 2 . 2 ) 


sup 

«6Flhllv<l 


qu) > 7I 


iw ■ 


2.2. Proposition (A priori bound). Under the conditions of Assumption 2.1 we have the following a priori 
bound for the solution of §3): 

(2.3) 

where V* is the dual space ofV. 


W\\yy<C\\l\\y., 


We let Vfi, Wfi be hnite dimensional (not necessarily conforming) spaces. Suppose that 14, Wh are equipped 
with norms IHIvy >IMIwfe respectively and Wh a seminorm Let W{h) = W + Wh and let W{h) be 

equipped with the norm such that 

(2.4) MwhGWh 

(2-5) lklliT(/.) = IkllvT VwGW 

( 2 . 6 ) WwGW 

(2.7) Ikll,., = ||u||^ VuGK 
We consider the Galerkin method to seek Uh G Wh such that 

(2.8) £^h{uh, Vh) = lh{vh) yvhG 14, 

where s^hi', ■) ■ Wh x 14 —t IR is an approximation of •) and lh{-) : 14 —t R is an approximation of Z(-). 

2.3. Assumption (On the Galerkin method). We will assume that the Galerkin scheme is bounded in the 
sense that 


(2.9) 


^h{wh,Vh) < Cb lk/i|lvfe y WhG Wh,Vh G 14- 
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2.4. Remark. Note that the is no assumption on the convergence of the Galerkin scheme, only that the 
problem is stable. 

2.5. Assumption (Conforming finite dimensional space). Suppose another finite dimensional space, W, 
exists and is a conforming approximation of W, i.e., W C 14^ and there exists a reconstruction operator 

E :Wh-^y. 

2.6. Theorem (Strang type a posteriori result). Let e := u — Uh = {u — E{uh)) +{E{uh) — uu) ='■ 
then we have the a posteriori result that for any € Vh 


( 2 . 10 ) 


|e|lwh ^ 


sup [lh(v - Vh) - £/h(uh,v - Vh)] + sup [l(v)-lh(v)] 


7 




+ sup l£/h(E(uh),v) -’e/(£l(uh),v)l 
«eClhlC<i 

= : —(<fi + (02 + (^3 +(7 + Eb) (^ 4 ) ■ 
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+ 1 + 


Cf 


I 


\Wh 


2.7. Remark (Structure of the estimate). It should be noted that the result in Theorem 2.6 is not a posteriori 
computable since it involves taking the supremum over an infinite dimensional space. It is, however, the 
beginnings of an a posteriori bound. The first term represents a standard residual term which can be 
computably estimated using standard techniques |AOOO[ c.f.]. The second and third terms (^ 2,(^3 measure a 
form of inconsistency error and the fourth term, ( 04 , the nonconformity error. 

2.8. Remark (Comparison with the a priori Strang Lemma). Recall the Strang a priori result that if the 
bilinear form associated to the Calerkin method (2.8) satisfies a discrete inf-sup condition, that is 


( 2 . 11 ) 


sup ■s/h{wh,Vh) > Jh 

'UheVh,\\vh\\Y^<l 


then 

( 2 . 12 ) ||e||^y <(l +— ) hii \\u-Wh\\w^ + —( sup Mvh) - £/h{u,Vh)] 

The inconsistency term represents how badly the solution u satisfies the discrete scheme. Theorem |2.6| may 
be reformulated as 


(2.13) 



sup [lh{v - Vh) - ■s^h{uh,v - Vh)] + sup [l{v) - £/{E{uh),v)] 

i’6Clhllv<i i’ecihllv<i 

+ sup [lhiv)-S!/hiE{uh),v)]')+(l + —')\\e^\\ 
vev,\\v\\Y<i ^ 77 


and we may think of the inconsistencies as how badly the reconstruction E(uh) satifies both the smooth 
problem and the Calerkin approximation. 

Proof of Theorem |2.6| Using the stability of the continuous problem from Assumption |2 . 1 1 we have for any 
Vh G Vh 


(2.14) 


e‘^||w<-( sup £/(e'^,v)] =-( sup [^{u,v) - £/{E{uh),v)] 

7 V’'Gy.|hlC<i / 7 Vt'eU||«IC<i 

<-( sup [l{v) - £/{E{uh),v)]] , 

7 V’^eUlhllv<i / 
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using the continuous problem (|2.l|). Adding and subtracting appropriate terms, we have 

\\„c 


I 1 

L<- 


sup 


7 v«Gy,|hlA<i L 


(2.15) 


l{v) - lh{v) +lh{v- Vh) - ^h{uh, V - Vh) 

- £^h{E{uh) -Uh,v)+ £^h{E{uh), v)- £/{E{uh), v) 
( sup [lh{v - Vh) - s^/hiuh, V - Vh)] + sup [l{v) - Ihiy) 


1 

< - 
7 V«GV.|h|lv<l 


+ sup [£/h{E{uh),v) - £/{E{uh),v)]+ sup £/h{E{uh) - Uh,v) 

'veV,\\v\\^<l ■u^V,\\v\\y<l 


in view of the triangle inequality. To conclude we note that 


(2.16) 


sup s/h(.E{uh) - Uh,v) < Cb > 

«6V'.lhllv'<l 


from Assumption 2.3[ (|2.5|) and The result then follows from 

„N\\ 


(2.17) 

concluding the proof. 




< lle'^lliy + lie 


\Wh 


□ 


2.9. Remark (Computability of (4.121). For (4.12) to be useful some assumptions need to be made on the 
reconstruction E. In particular, we impose that 

(2.18) \\E{wh) - Wh\\^r^ < C\wh\y^^^ ywh&Wh- 

Examples of reconstruction operators satisfying this condition for various test problems will be given in 
subsequent chapters. 

2.10. Definition (Formal dual problem). Let L be a Hilbert space such that for any v G V we have 

(2.19) 


\vh<C\\v\\y. 


We define the formal dual (adjoint) problem to (2.1) to be: For any g G L, find z{g) G Z CV such that 
(2.20) £/*{z{g),v) = fL{g,v) MvGV, 

where £/*{v,w) = £/{w,v). 

In addition we suppose that the bilinear form /l(’, •) induces a seminorm on L, i.e., 

hig,-) 


( 2 . 21 ) 


II := sup- 


geL ||<?||y 

2.11. Assumption. In addition to Assumption |2.1[ assume the formal dual problem is well posed, that is, 
for all g G L 

£/*{z{g),v) 


( 2 . 22 ) 

then we immediately have that 
(2.23) 


7* \\z{g)\\z < sup 


vev l|u||y 

l^(ff)IU<:^ll5llL- 


2.12. Theorem (Dual aposteriori bound). Using the same notation as in Theorem 2.6 under Assumptions 

Oand lJID 


l|e||i<^( sup [lhiz{g) - Zh) - £/h{uh,z{g) - Zh)]+ sup [l{z{g)) - lhiz{g))] 

7 ^ sei,llslli,<i sei,llslli,<i 

(2.24) + sup [.s/h{E{uh),z{g)) - £/{E{uh),z{g))]+ sup £/h{E{uh) - Uh, z{g))) 

oGL,l|o|h<l oGi,l|o|h<l ^ 


=: — UE + ST + S’T + rfA) . 
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Proof Proceeding along the same lines as Theorem 2.6 we have that e := u — Uh = (u — E(uh)) + 
{E{uh) — Uh) =■ and using the stability of the dual problem 

(2.25) 

geL,\g\j^<i geL,\g\^<i 

Adding and subtracting appropriate terms gives 


sup /i(e'^,g)= sup £/*{z{g),e'^) = sup £/{e^,z{g)). 


„C\ 


= sup l{z{g))-lhiz{g)) + lh{z{g)-Zh)-s^h{uh,z{g)-Zh) 

geL,\g\^<i 

- szfh{E{uh) - Uh,z{g)) + ^h{E{uh),z{g)) - £/{E{uh), z{g)) 


(2.26) 


< sup 

g&L,\g\^<l L 


sup 

g^L,\g\^<l L 


h{z{g) - Zh) - s^hiuh, z{g) - Zh) 

£/h{E{uh),z{g)) - £/(E(uh),z(g)) 


sup 

9^L,\g\^<l L 


Kzia)) - Ihizig)) 


sup jzfhiE{uh) - Uh,z{g)) 

geL,\g\^<i 


concluding the proof. 


□ 


2.13. Remark (Computability of (2.24)). As in the primal case Theorem 2.12 is not useful in its own 
right without further assumptions, for example one would desire that the finite element space satishes some 
approximability assumption for any v G Z 

(2.27) inf \\v - VhWy^ < Cih°‘ \\v\\z ■ 

vheVh 

The full range of assumptions required for dual a posteriori error control tends to be very problem specific 
as will be illustrated in the sequel. 


3. Applications to discontinuous Galerkin approximations of 2nd order linear self 

ADJOINT PROBLEMS 

In this section we give an illustrative example showing how to apply the framework to derive a poste¬ 
riori bounds for various schemes approximating a model problem. To that end, suppose that the function 
spaces defined in the previous chapter are defined over a domain Q. Let ^ he a, conforming, shape regular 
triangulation of 11, namely, ^ is a finite family of sets such that 

(1) K G implies K is an open simplex (segment for d = 1, triangle for d = 2, tetrahedron for d = 3), 

(2) for any K,JG^we have that AT H J is a full subsimplex (i.e., it is either 0, a vertex, an edge, a 
face, or the whole of K and J) of both K and J and 

( 3 ) [jnesr K = Q. 

The shape regularity of ^ is defined as the number 

(3.1) n{^):=ini 

KeSZ tlK 

where px is the radius of the largest ball contained inside K and hx is the diameter of K. An indexed 
family of triangulations is called shape regular if 

(3.2) /i:=inf/x(5^") >0. 

n 

We use the convention where h : 12 —>■ IR denotes the piecewise constant meshsize function of , i.e., 

(3.3) h{x) := maKhx, 

KBx 

which we shall commonly refer to as h. 

We let S ’ be the skeleton (set of common interfaces) of the triangulation tiZ and say e € ^ if e is on the 
interior of 11 and e S 911 if e lies on the boundary 911 and set he to be the diameter of e. 

We let denote the space of piecewise polynomials of degree k over the triangulation St',i.e., 

(3.4) = {(j) such that (j)\x G P'=(A:)} 
and introduce the finite element space 

(3.5) V ■.= Vo { k ) = P ’’’{ S ') 
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to be the usual space of discontinuous piecewise polynomial functions of degree k. 

3.1. Definition (broken Sobolev spaces, trace spaces). We introduce the broken Sobolev space 
(3.6) Y^\£r) := !</. : e ^\K), for each if S .T} . 

We also make use of functions defined in these broken spaces restricted to the skeleton of the triangulation. 
This requires an appropriate trace space 


(3.7) 


T{<g) := n MdK). 

K&sr 


3.2. Definition (jumps, averages and tensor jumps). We may define average, jump and tensor jump operators 
over T{<S) for arbitrary scalar functions v € T{<S) and vectors v € T((o’)^. 


I-}-- r{<g) 


H: ns) 

V 






L2(^) 

\{v\ki + v\k 2 ) 


on 50 

+v\K2nK2 

1(^^»^) Ian on dVL 

[•1^: [ T {^ udn)r 

V 




V 




H: [ns)r 




(v\ki 0 riifi + v\k2 0 nK2 


Uiv\K^ +v\k 2 ) 

\u|an on dfl 
L2(t?) 

'{v\K,ynK, + {v\K2ynK2 

(tjTn) Ian on dfl 


3.3. Conforming reconstruction operators. A simple, quite general methodology for the construction 
of reconstruction operators with the desirable properties mentioned in Remark |2.9| is to use an averaging 
interpolation operator into an H® (s = 1,2) conforming finite element space. For example the Oswald 
interpolant into a C° Lagrangian finite element space can be used for a conforming operator [KP03] or 
a Hsieh-Clough-Tocher macro element conforming space for conformity iBOblOlirTHVTTl C.f.]. 

3.4. Example (Lowest order H^(f2) and H^(0) reconstructions). An example of the lowest order (p = 1) 
H^(0) reconstruction operator E^{uh) is for given rt/i S V to specify the values of the reconstruction at the 
vertices as an average in a local neighbourhood, as with the Oswald interpolant |EG04j . In general let x be 
a degree of freedom of the conformal space W and let be the set of all elements sharing the degree of 
freedom x then the reconstruction at that specific degree of freedom is given by 


(3.8) 


E{uh){x) 


1 

card(Ara;) 


Uh\K(.x). 


In the case A: = 1 the associated degrees of freedom are illustrated below: 

o o 

H^(fl) reconstruction 

' E^{uh) 

O O O O 


Notice that the degrees of freedom of the reconstruction match that of the original function, as such 
E^{uh) € V, this is because of the existence of a conforming H^(n) subspace in V. For H^(n) conformity this 
is true for any k, for H"^(n) one would need fc > 5 to have access to appropriate elements, as such a richer 
space is required. For k = 2 the Hsieh-Clough-Tocher macro element is used with degrees of freedom 
given below: 
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o 


© 


o 


o 


Uh 

o 


o 


H^(17) reconstruction 


o 


o 


© 


E^^h) 

© 


© 


© 


Note that the reconstruction is no longer an element of V. 

3.5. Definition (H^ and mesh dependant norms). We define the and mesh dependant norms to 
be 

(3-9) ll'*^^ll<iG.2 •“ l©^'*^^llL2(n) + IIllL2(if) 

(3-10) iik/iiiiL,2 := ii[vwhiiiL2(,f) + llMllL(.f) • 

3.6. Lemma (Reconstruction bounds |KP03[ fGHV 11 j b There exist operators : V ^ s = 1, 2 such 


that 


(3.11) 

\\E (ufi) < Ghe |||M;i]||L2(,5') 

(3.12) 

\\E\uh) - < Ch-^ |IK1IIl2(.?) 

and 


(3.13) 

\\E^{uh) - UhllG,2 < + h-^ llKlll 


2 

L2((^) 


)■ 


Proof The proof of the bound for E^(uh) is given in |KP031 Thm 2.2] and for E^{uh) is given in |GHV111 
Lem 3.1]. □ 

o 

3.7. Laplace’s problem. We begin by setting V = W = H^(i7), which is the subspace of H^(r2) with 
functions vanishing on dfl, and let 


(3.14) 


j)= Vu • Wv. 


2.1 


We 


This operator is coercive and thus certainly inf-sup stable in H^(n), thus satisfies Assumption 
consider discontinuous Galerkin approximations with Vh = Wh = 

The interior penalty (IP) method [Arn82] 

(3.15) £/hiuh,Vh)= j yhUh-'^hVh-J • I Vu/j}-I-|t>h] • {{ Vm/i}| n,i] 

o 

is well known to be bounded in the H^(n) like dG-norm ||•||£;(5 2 ■ ^ conforming space W C H^(f7) can 

o 

be found by enforcing continuity of the piecewise polynomial functions, i.e., taking W = PP{^) n H^(n). 
Reconstruction operators E are not unique in this setting. Indeed, one example can be found in |KP03] and 

o 

another the Ritz projection of Uh into n H^(n). 


3.8. Remark (Artificial regularity requirement). The IP method given in (3.15) is only consistent for u € 
jj3/2-i-£^^^ since we require VitJ- to be well defined in order to invoke Strang’s Lemma. This regularity 
is all that is required for the a posteriori analysis based on coercivity given in |KP03| for example but is 
insufficient for the abstract framework devleoped in fj^ for this we would also require v G H^^^''“'^(n) (in the 
evaluation of s/h{uh,v — Vh) for example) which is incompatible with the stability theory. For the bound in 


Theorem 2.6 to make sense it is required that the discrete bilinear form s/h{‘, •): which is well defined over 
(5^) X IP {^), is extended to H^(.5^) x H^(l7), the correct domain to invoke the appropriate stability 
theory used for this class of PDF. 

The extension is not unique. Indeed, one method of extending is to modify the definition of the bilinear 
form for G H^(fF) to 

(3.16) £^h{u,v) = J^'^hU ■ Vhv “ ^(M • I Pfc-i(Vu)S + H • I Pfe_i(Vu)| |u] • |u]^ , 


















where Pk-i denotes the L 2 projection into Note that this is very similar to a proceedure proposed 

in |GILlll §4], however extending the bilinear form in this fashion means that (3.161 coincides with (3.15) 
over V X V, as such (3.16) is the classical IP method interpreted in such a way that a posteriori analysis can 


be conducted using the tools of 

By modifying the bilinear form in this fashion the method becomes inconsistent (in the a priori sense of 


(2.12)), but the inconsistency term is controllable and convergence is optimal. Another method to see this 


is given in [GudlOj . Here the author uses a posteriori techniques and the averaging operator from |KP03) to 
control the inconsistency arising from the lack of regularity up to data oscillation terms. The a posteriori 
bound derived under the framework of is not effected by the inconsistency as we will see in the sequel. 


As the IP method, the Babushka-Zlamal (BZ) method |BZ73) 


(3.17) 


■S^h{uh,Vh)= / '^hUh-'^hVh + / 
Jn Jg 


luhl ■ 


is bounded in the dG-norm, |1 2 ^ however it is inconsistent regardless of the regularity of u, that is the 

second term in (2.12) is always nonzero. It is of optimal order hence, in view of Strang’s Lemma, the method 
converges optimally. Note also that this bilinear form can be trivially extended to H^(^) x 


3.9. Proposition (Approximation properties of L 2 projections [DPEI21 c.f. §5.6.2.2]). Let Pq denote the 
L 2 orthogonal projection into P°(lZ) then for any v G H^(.^) we have the following local approximation 
bounds over elements, K G and faces e G S’: 


(3.18) \\v-Povh^^K)<ChK\\yv\\^^(^^ 

(3.19) |k-Po^|lL,(e) 

3.10. Proposition (Approximation properties of the Scott-Zhang interpolator |EG04I c.f. §1.130]). Let Ij. 

o 

denote the Scott-Zhang interpolant into V n H^(H), then the following local approximation bounds over 
elements, K G fZ and faces e G S' hold: 


(3.20) \\v-Ikv\\^^^i,^<ChK\\yv\\^^^-^ 

(3.21) 


where K denotes a patch of an element K. 

3.11. Lemma (Energy norm a posteriori control). Let Uh be either the IP or BZ approximation of the 
solution of Laplace’s problem, then up to a constant independent of the meshsize both approximations are 
a posteriori controllable by the following bound: 


(3.22) 


\u-uh\\dG,2^^[ + 

XKeS eeg ) 


ith 


(3.23) 


■— 11/ + ^^^>llL2(if) 


2 

L2(e) 


Proof Apply Theorem |2.6[ noting the inconsistency terms for both methods vanish. Then use standard a 
posteriori arguments [AOOOl IKP031 c.f.] for the residual term which we summarise here for completeness. 























Firstly, for the IP method 


(3.24) 


lh{v - Vh) - S!^h{uh,v - Vh) = / f{v-Vh)-'Vuh-{^v-'\/vh)+ / luhj- I Pk-li'^V -Wvh)} 

Ja Jg 

+ j^lv-vh\i Vuh} [m/«1 • 

= [ if + ^Uh)iv - Vh) - [ iVw/i] I - i;/i } 

Jn Jg 

+ J luhj-lPk-ii^v-Wvh)}-^luhj-lv-Vhj 

=: ^1+^2+^3+ ^4- 


Using Cauchy-Schwarz we see 


(3.25) 


?i = / (/ + Auft)(w - Vh) 

Jn 

= 11/+ ^^^llL2(if) 11'^ ~'^^IIl2(a:) 


Kes 


< (7 ||Vz;||L2(n) ^ 11/+ , 

KGgr 


by taking Vh — Pqv, the L 2 orthogonal projection into P°(.;7) and using Proposition 3.9 For the second 
term, using the same Vh 


(3.26) 


^2 = - X! / 1 ^ 1 

eeg''^ 

<EllI^«JllL.(e)ll«^-^'ailL2(e) 

<C^I|Vt^|lL,(n)E^ElllVuJ||L,(,). 




For the third term, in view of Cauchy Schwarz and a trace inequality and the stability of the L 2 (n) projection 


(3.27) 


^3 = E / « Pk-ii^v - WVh) } 

<CEllKlllL2(e)lll^^fc-l(V^^-V^;.)S|lL,(e) 


eeg 

< CIlVul 


L2(f2) 


E^. 


- 1/2 


L2(e) 


and similarly for the final term 


(3.28) 


= E / ^ luhj ■ [i' - 

eeg''<^ 

<E / '^^/MlKlllL2(e) III^'-^'hl|lL2(e) 


eG<f ® 

< CctIIVuI 


Lain) 


E^ 

eeg 
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- 1/2 


Lale) • 



Collecting (3.30)-(3.28) we have 

S'l := sup [Ihiv - Vh) - £i^hiuh,v - Vh)] 


(3.29) 


-^f X/ ^1. 


1/2 


\Kes ees 

yielding the desired result. For the BZ method we note 


(3.30) 


lh{v - Vh) - £^h{uh,v - Vh) = / f{v-Vh)-'^Uh-{'^v-\7vh)-^luhl-p^v-\7vhj 

Jn “ 

= / (/ + Au;,)(u - Uh) - [ [Vu/j] I U - U/il 
dn Jg 


=: ^1 + ^2 + ^3 + ^4- 

Notice that ^ 1,^2 and ^4 are the same terms that appear in the analysis of the IP method above. The 
third term is different due to the nature of the inconsistency of the scheme. This term vanishes as soon as 
Vh is chosen as a continuous function. It suffi ces therefore to pick Vh as the Scott-Zhang interpolator into 

o 

V nH^(n) and making use of Proposition 3.10 The bounds for the other terms follow analogously as above, 
concluding the proof. □ 

3.12. Remark (Lower bounds). It can be proven that the a posteriori bounds for both IP and BZ methods 
satifsy lower bounds to the error, that is, using the notation of Lemma |3.11| 

(3A1) i + — 11'“ ~'*^^lldG,2 + 11/~-^fc/llL2(n) ■ 


\K&sr 




The last term in (3.31) is called a data oscillation term. See |KP07| for a proof of this fact for the IP method. 
The arguement for the BZ method is identical since the a posteriori bound is the same. We will examine 
how other forms of inconsistency effect lower bounds in the sequel. 

3.13. Remark (Dual norm estimates). Obtaining optimal dual norm estimates is slightly more delicate than 
the primal ones. Suppose the method is consistent, and in particular adjoint consistent |ABCM02] . that is 
if z S H^(n) solves the dual problem —Az = g then 


£/h{v,z)= f vg VuSH^).^). 

Jn 


The IP method satisfies this condition and optimal a posteriori bounds can be obtained in the dual norm 
quite simply under the framework of Theorem |2.12| as the following Lemma illustrates: 

3.14. Lemma (Adjoint consistent dual norm a posteriori control). Let Uh he the IP approximation of the 
solution of Laplace’s problem, then up to a constant independent of the meshsize the following a posteriori 
bound holds: 


(3.32) 
with 

(3.33) 


XKesr ees ) 


1/2 


hl,R -■= \\f + AuhWl^^ii^ 

vIj ■= hi WWuhMl^ie) + <^he 


2 

L2(e) ■ 
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Proof To control the nonconformity term arising in the dual a posteriori bound (to’/' in (2.24|) we note 


(3.34) 


Hence 


■^h{E'^{uh) - Uh,z) = / - Uh 

JQ 

= !-{E^{uh) - Uh) IS.Z. 

JQ 


^5= sup £/h{E^iuh) - Uh, z) 
96i.l9L<l 


Vz- 


Iw/^l 


Vz' 


<1 JQ 


(3.35) 


= sup 

geL2(n),||g||L2(ri)i 

< sup 

gGL2(n),||3||L2(ri)<l 

< — \\E'^{uh) - Uh\ 

" ' 


[ -{E^{uh) - Uh) Az 
Jn 

E (u/j) — II Az||l^(q^ 


L2(n) 


<ChJ Ill'U'ft.l|I l2(^) • 

The bounds for the other terms follow from standard a posteriori arguments and can be derived by mimicking 
the proof of Lemma |3.11| and using regularity bounds for the solution of the dual problem. □ 

3.15. Remark (Dual bounds do not require reconstructions). To construct the dual a posteriori bound in 
L 2 for adjoint consistent methods it is not necessary to use the reconstruction approach. The numerical 
approximation is smooth enough to directly apply the stability theory of the dual problem directly, at least 
in the case of Laplacian and biharmonic operators |GV13) . 

3.16. Remark (Dual norm estimates for inconsistent methods). For adjoint inconsistent methods, i.e., 
those not satisfying the condition in Remark 3.13 to obtain optimal dual norm a posteriori estimates 
over-penalisation is necessary, as in the a priori case [ABCMO^ §5.2]. We consider the over-penalised BZ 
method with 


(3.36) 


s^h{uh,Vh)= / ^hUh-^hVhE / luhl ■ Ivhj , 

Jn Jg 


for ,5 > 1 and define the over-penalised norm as 

(3-37) ■“ II'^^'^'»IIl2(0) + ^ II llL2((f) • 

Optimal a posteriori bounds can be obtained for the over-penalised BZ method with /3 large enough under 
the framework of Theorem 2.12 as the following Lemma illustrates: 

3.17. Lemma (Adjoint inconsistent dual norm a posteriori control). Let Uh he the over-penalised BZ approx¬ 
imation of the solution of Laplace’s problem, then for P > 3 up to a constant independent of the meshsize 
the following a posteriori bound holds: 


(3.38) 
with 

(3.39) 




1/2 


'’lo,R ■— 11/ -I- 

vIj ■= hi |IIVWhl||L2(e) IIKlllL2(e) ' 

Proof The proof is analogous to Lemma |3. 14| with the exception of the nonconforming term, here 













Following the arguments of [ABCMO^ §5.2] we have that 

[ lE\u^) - u,] . I Vz} < Ch-^/^ II lE\un) - ||l,(^) |1| 

< c lls^M - u,||~ ^ I1Az||l,(o) 


(3.41) 


via a trace inequality and the definition of the over-penalised dG norm (3.37). Now through an inverse 
inequality we see 


(3.42) 
and hence 


[ lE\uh) - • S VzS< Ch(f-3)/2 ||£;l(u;,) - U;,| 

J s 


U{n) 


IIAzll 


L2(a) 


S’ 5 = sup s^h{E^{uh) - Uh,z) 

9 &L,\g\^<l 


sup 

gGL2(0).||g||L2(n)<l O. 


(3.43) 


< 


< 


[ -[E^{uh) - Uh) Az+ [ lE^iuh) - u/i] • I Vz} 
JQ Jg 

II A^||L2(n) 


sup C(l + \\E^{uh) - (H) 

9eL2(n).||glh2(m^i 


^(l + 


,(/ 3 - 3)/2 


7* 

<^^^2II Kill 

for ^ > 3, concluding the proof. 


\E^{uh) ''^/i||L2(n) 


L2 {g) 


□ 


3.18. Quadrature approximation. Perhaps the first topic of thought when considering notions of in¬ 

consistency in finite element analysis is that of quadrature approximations. In the case when the bilinear 
form contains a positive definite, non constant diffusion tensor inconsistencies can arise from the inexact 
integration |Cia781 §4.1]. Take A € and 

(3.44) s^(u,v) = I (AVu) • Vu, 

Jn 

for example, then both linear and bilinear forms are practically approximated by using quadratures. 

3.19. Definition (Quadrature). We introduce 

(3.45) Qk(/) := E 

iq — 1 

to be a quadrature where denote the positions and weights of the quadrature points over an 

element K. This is said to be an order s quadrature if 


(3.46) [ f = QW) V/GP^(iF) 

JK 

and 


(3.47) 


f{x) dx-Qj^{f) 


fK 


< l/lH'’(iC) 


V/GH*(7^). 


Let us, for sim plicity, temporarily consider the C° conforming method under quadrature approximation 

o 

applied to (3.44) which reads: Find u;, G V n H^(n) such that 


(3.48) 


E g^((AVuft) • Vvu) = E QW^h) Vu;, G Vn R\n). 

K^sr K^sr 
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3.20. Remark (Standard quadrature degree choice). A relatively standard choice of quadrature degree is 
s = 2k—2. This is because if A is constant then the quadrature allows for exact evaluation of the bilinear form. 
The linear form however is not, in general, integrated exactly. Indeed, the linear form can be interpreted as 

(3.49) h{vh) = Y. = E / P^-2fvh, 

where Pk -2 denotes the L 2 projection into pmax(fe- 2 .o)^^^_ 


3.21. Lemma (A posteriori bound for quadrature approximations). Let u S H^(r2) be a weak solution to 
the variational problem 


(3.50) 


{u,v) = l{v) VuGH^(n), 


with £/{u,v) given by (3.44) and Uh solve {3.48) with quadrature degree 2k — 2, then 

/ \ 1/2 


(3.51) 


where 


(3.52) 

Vi 


Vj 


||m — — P' ( E + E I ’ 

\Ke3r e^S j 

= hK \\Pk-2f + div{Pk-i{A\/uh))\\Yii) 

= hi 11/ - Pfe-2/|lL(K) + IK^^fc-l - Id)(AVu;,)||^^(^) 
= K\\lPu-l{A^UhmY,y 


3.22. Remark (Evaluation of the estimator and data oscillation). The indicator components rjji, rjj have the 
standard form of an a posteriori estimator with the exception that they involve projections of the problem 
data. This allows for these terms to be evaluated exactly using the same quadrature as used in the scheme, 
that is, there is no additional approximation involved in the computation of these terms. 

The estimator rji, representing the inconsistency, is a data oscillation term. In general, the integration 
required in computation of the estimator cannot be performed exactly using the quadrature degree in the 
scheme. If some smoothness is assumed on the problem data these terms can be considered of higher order 
in comparison to the residual, however in practical computations resolution of the problem data is extremely 
important especially at coarse mesh scales, see |BDN13) . In the design of convergent adaptive algorithms 
the inclusion of these terms in the marking strategy is of paramount importance see jMNS021 IMN05| for 
further arguments. 


Proof of Lemma |3.21| Since the method we consider is conforming Theorem |2.6| simplifies to give 

||Vu-< -( sup [lh{v - Vh) - £^h{uh,v - Vh)]+ sup [l{v) - lh{v)] 

^ ' 'T \ o o 


uGHM|d||o <1 


«GHM|«||o <1 


(3.53) 


+ sup [£/h{uh,v) - £/{Uh,v)] j 

o / 


<1 
h1 

=: —{(S’l + (02 + <^ 3 ) ■ 

7 


The first term is a standard residual term and can be controlled using standard a posteriori techniques. 

lh{v-Vh)= E / Pk- 2 f{v - Vh) . 


Indeed, using (3.49) we interpret 

(3.54) 

Similarly, 

(3.55) s!/h{uh,v - Vh) = E / Pk-i{AVuh)'V{v - Vh). 

or JK 




IK 
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Hence 


(3.56) 


lh{v - Vh) - £^h{uh,v - Vh) = ^ / {Pk- 2 f + div{Pk-i{A\7uh))){v - Vh) 

Kes 

- Pk-i{AVuh) ■ n{v - Vh) 

JdK J 

= ^ f {Pk- 2 f + d^'v{Pk-iiAVuh)))iv - Vh) 

Ke£r 

- [ lPk-i{AVuh)l{v - Vh). 

Jg 

Taking Vh to be the Scott-Zhang interpolant of v and using the approximability properties of Proposition 
13.101 we have 

(3.57) <S’i<C ||Pfe_ 2 / + div(Pfe_i(AVM,,))||L^(^) + ^ hy2|||p^_^(AVu,,)]||L^(^)^. 


Kesr 

As for the other term, note that 

^2 = sup [l{v) - Ihiv)] 
■uGHi,Iloilo <1 


eGK 


(3.58) 


sup / “ Pk-2f) V 

dGH1,||j;||cj <1 ^ 

h1 

sup [ {f - Pk- 2 f){v - Pk- 2 V) 

. Jn 


'»eHM|j;||o <1 ^ 

h1 

< ^ sup ^ ||/-Pfc_2/||L,(iC) ||u-P/c-2u||l,(k) 

•uGHI.I loilo 
h1 

<C ^ h;G||/-Pfc_2/|!L,(^), 

Kes 


using the approximability of the L 2 projection operator from Proposition 3.9 
The final bound follows from definition since 

^ 3 = sup [£/h{uh,v) - £/(uh,v)] 
■uGHMIuIIo <1 


(3.59) 


sup Y I iPk-i-^d){AVuh)Vv. 
uGHi.Iloilo <iKe3r' 


'K 


< Y m-i-id)iAvuk)\\ U{K) y 

Kegr 


by Cauchy-Schwarz. Collecting (3.57), (3.58) and (3.59) yields the desired result. 


□ 


3.23. Proposition (A posteriori lower bound under quadrature approximation). Let u, Uh and rjn^rij be as 
in Lemma 13.211 then 

(3.60) 


m + + hi) 1 


eGiC 


where K denotes the set of all elements sharing a common edge with K. 

Proof The proof of this fact is relatively standard, we will include the first part, that of the element 
residual, for completeness and to compare with a result in a later section. Throughout this proof we will 
use the convention that a means a < Ch where C is a generic constant that may depend on the problem 
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data, but is independent of the meshsize u and Uh- We begin by defining the bubble functions. Let bx be the 
interior bubble function defined on the reference triangle iLref with barycentric coordinates {Ai}^,^g through 
={d+ n?=o Due to [Ver96| these functions satisfy 

(3-61) \\bKv\\^^(^x) < ^ \\bKv\\i^^(^x) for ^ ^ 

In view of this we have that 

(3.62) \\Pk-2f + diyiPk-iiAVunml^^x) ^ 6^/"(Pfc-2/+ div(Pfe_i(AVu^)))"" 


L 2 iK) 


SO 

(3.63) 


\\Pk-2f + diYiPk-i{AVuhml,(K) ^ f iPk-2f - f)bKiPk-2f + dW{Pk-i{AVuh))) 

J K 

+ [ (/+ div(zlVuh)) 6if(Pfe_2/+ div(Pfe_i(AVu/i))) 

J K 

+ [ {div{Pk-i{AVuh)) - div{AVuh))bKiPk-2f + diy{Pk-i{AVuh))) 
J K 

< [ {Pk-2f - f)bK{Pk-2f + dW{Pk-i{AVuh))) 

J K 

+ f {AVuh-AVu)-VibK{Pk- 2 f + div{Pk-iiAVuh)))) 

J K 

+ [ {Pk-i{AVuh) - AVuh) ■VibKiPk-2f + div{Pk-i{AVuh)))), 

J K 

as bx vanishes on dK. Now using Cauchy-Schwarz, further properties of bx and inverse inequalities 

\\Pk- 2 f + div(Pfe_i(AVu,))||L,(^) < \\Pk- 2 f - /|Il,(k) + IIAVu - AVu^||l,(^) 

||AVit;j — Pfe_i(AVu^)||L^/j^, 

(3.64) 

^ \\Pk-2f ~ /llL2(i<') d~ b-x ||Vu — ^'Uh\\-^^(^x) 

+ ||AVu/i — Pk—i{A\ 7 uh)\\i^^(^x) ■ 

To see that the jump residual behaves in a similar way one may follow the arguments in |EG04l Proof of 
Thm 10.10] with a slight modification to take into account the data approximation. □ 

The extension to a nonconforming method is almost immediate using the reconstruction operator from 
[KP03] and the arguments previously. For instance, consider the problem to seek Uh G y such that 

(3.65) ■s/h{uh,Vh) = Ihivh) Vu/^eV, 
where 


■s/hiuh,Vh) := ^ Q^j^-\{AWuh) ■ ^Vh) - ^ 

(3.66) 

+ Q 


Qf-\lukj-iAVvnl) 


2k-l 


AVu,*}) - -Qg (M • |uj) 


and lh{vh) is given by (3.49). This is an example of an IP method after quadrature applied to (3.44). Notice 
the difference in quadrature degree chosen to integrate over edges. In addition note the similarity in structure 
to that of (3.16). 


3.24. Corollary (A posteriori bound for nonconforming schemes including quadrature approximation). Let 

\ 1/2 


u be a weak solution of (3.44) and Uh solve (3.65), then 


\U - Uh\ 


dG, 2 ^ hR+'ni^vj 


\Kesr 

15 


ees 


(3.67) 



















where 


(3.68) 

~2 

hi 


Vj 


Also for a large enough 
(3.69) 


= \\Pk- 2 f + 

11/ 2 /IIL 2 (K) + \\iP,.,-ld){AVE\u,))\\l^^^ 

= he \\lPk-l{A'^Uh)l\\l^^e) + IIKlllL2(e) ' 

< (^(11 Vm - Vu/i||l,(k) + m) 

eGK 


with 

(3.70) 77 I := hj, 11/ - Pk-2f\\l^K) + WiPk-i - Id)(AVw,)||^^(^) 

Proof The main difference between control of the conforming approximation and the nonconforming is that 
of the penalty terms. To see these control the error from below we modify the arguement given in |DPE121 
Lem 5.30]. The main idea is that for variational second order problems there is a conformal reconstruction 
in the dG space. This allows us to make use of the numerical scheme in the following fashion. We interpret 


(3.66) as 


(3.71) 


J^h{uh,Vh) = / Pk-i{AVhUh) ■ VhVh 
Jn 


tg L 


(uhl I Pk-iiAVvh)} + I Pk-i{AVuh)} -(The ■ HI 


which, using the fact that Uh solves (3.65), allows us to choose Vh = Uh — E^{uh). From this we see, since 
|if^(u/i)] = 0 , that after an integration by parts 

(3.72) 

= / {Pk- 2 f+ dhi{Pk-i{A\Iuh))){uh-E^{uh)) 

Jn 


Ig L 


lPk-iiAVuh)j ■ I Uh- E\uh)}- K1 • I Pk-i{AV{uh - E\uh))) } 
< ||Pfe-2/ + div(Pfc_i(AVu,i))||L,(n) hh - E\uh)\\j^^^^) 

+ WlPk-iiAVuhMu^g) IIS Uh - E^UhnW^^^^^ 

+ IIKIIIl.(^) III Pk-i{AV{uh - E^{uh))) }||l,(,.) 

=: J1 + J2 + Js- 

The first term, owing to the properties of the reconstruction E^{uh) is controlled by 
/i = \\Pk-2f + div(Pfe_i(AVMft))||L,^(s^) \\uh - E\uh)\\^^^^^ 

< ChH|lPfc_2/ + div(Pfc_i(AVu;,))||L,(0) IIMIIl.(^)- 

The second term, via a trace trace inequality 

^2 = ||[Pfc-i(AVu„)l||L,(^) III Uh-E\uh)}\\^^^^^ 

(3.74) < Ch-P^ ||[Pfc_i(AVu^)l|lL,(^) \\uh - E\uh)\\^^^^^ 

<C\\lPk-i{AVuhmL.ig)\\luhlh,ig)- 

Finally the third term, again by a trace inequality and the stability of the L 2 projection 

,/3 = IIK1I1l.(^) III -p 1 (u„)))}||l^(^) 

< Ch-P^ IIK1||l,(,.) ||Pfc-i {A\7{uh - E\uh)))\\^^^^^ 

< Ch-P^ ||MllL,(,f) ll^vH - E\uh))\\^^^^^ 

— Ehe ^ IlIw/illiL^C^) ■ 


(3.75) 
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Substituting ( |3.73[ ), ( |3.74 ) and (3.75) into (3.72) yields 

(3.76) IIKlliL(^) ^ C(||Pfc_2/ + div(Pfc_i(AVu;,))||L,(^) + ||[Pfe-l(AVu,)lllL.K)) ■ 

Hence for a > C the penalty term is controlled by the other residual terms which in turn, using the 
arguements from Proposition 13.23} are controlled by the error up to the data oscillation terms given by iji, 
concluding the proof. □ 

3.25. Remark (Inconsistencies require computation of reconstructions). In the statement of the bound for 
the nonconforming approximation of the Laplacian in Lemma |3.11| there is no requirement to compute the 
reconstruction operator, E^. Indeed, if there is no inconsistency in the approximation of the nonconstant 
diffusion problem (3.44) then it is still not necessary to actually compute the reconstruction operator. The 
nature of the inconsistency term ffi appearing in Corollary 3.24 requires computation of the reconstruction 
of Ufi- We will see in a later section that inconsistencies do not always require computation of this operator. 

4. Unbalanced variational and nonvariational problems 

In this section we will take a different route. For simplicity initially we shall ignore any possible effect 
of quadrature and focus on the effects of inconsistencies in unbalanced problems. We will consider the case 
W = H^(n) n Hi(U) and V = L 2 (fl). Let 


(4.1) 


s^{u,v) = / A:D^u, 
Jn 


where A G is positive definite diffusion tensor, denotes the Hessian matrix of u and X:Y = 

trace(X^Y') is the Frobenious product between matrices. The operator (4.1) is in nondivergence form. 
Galerkin schemes falling into the framework presented here are given in [LPlll IDP13] . Both are noncon¬ 
forming, inconsistent approximation schemes. 

4.1. Assumption (Strong solutions exist). Under either smoothness assumptions on A [GT83| . Gampanato 
[Cam94j or Cordes |Cor61| conditions on A the problem 


(4.2) 


JQ 


fv 


has a strong solution, that is, there exists a constant C > 0 such that 

(4-3) ll^llH2(n) ^ C" ll/llL2(a) • 

This is equivalent to Assumption |2.1| Note that if A = —I then this is a natural framework to study the 
Laplacian if it has strong solutions. 

4.2. Consistent nonvariational methods. Henceforth we will take k > 2. For A G Loo(U)‘^^‘^ satisfying 
Assumption |4.1|we define 


(4.4) £/h{uh,Vh) = / A-.Ei^Uh Vh- ([Vwh],g,:| AvhJ - ahe |Vu/i] iVvhj - crh^ ^ lu/i] • |u/i|) 
Jfi Js 

and seek Uh GV such that 


(4.5) 


^h{uh,Vh)= / fvh yvhG 


Note for u G H^(il) the method is consistent, in that 


(4.6) 


^h{u,v) = s^/{u,v) VueH^(fr). 


4.3. Remark (Extension of the bilinear form). The discrete bilinear form (4.4) only makes sense over 
H^(f5^) X H^(f7). To make use of the a posteriori framework in (j^we require an extension to ensure the 
appropriate stability arguemnts can be applied. That is we require the bilinear form to be extended to 
H^(f5^) X L 2 (fl). To do this for(M,u) G H^(f5^) x L2(S1) we define 

s^h{u,v)= [ A-.Bluv- [ (|Vu]^:-8; APfeu}-cr/ie[Vu]|V(Pfcu)]-crh;:^ H • |Pfcu]) . 

Jn Jg 
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(4.7) 





























Notice that the modified bilinear form (4.7) coincides with (4.4) over V x V and that it satisfies 


(4.8) 


s^h{u,v) < C'|||M|||dG .2 IMlL 2 (n) for {u,v) e x L 2 (n)) . 


4.4. Theorem (Primal consistent a posteriori upper bound). Let u S H^(r2) solve {4.2) and 
finite element approximation given by [4.5), then 


(4.9) 


\Kesr 


1/2 


\\u-Uhl\dG,2<ci Vr + Y^J 




where 


(4.10) 

(4.11) 


Vr \\f 

Vj ■= IIIVw/*lllL(e) + IIKlllL(e) • 


Proof Making use of the modified bilinear form given in Remark 4.3 we may apply the framework 
2 . 6 | directly here and we have 


||e||H 2 (n) ^ sup [lh{v - Vh) - ■s^h{uh,v - Vh)] + sup [^(^')- 

7 ^ i'6L2(n),||«||L2(n)<l «6L2(f2),|fi|lL2(n)<l 

(4.12) + sup [£/h{E‘^iuh),v) - S!/{E‘^{uh),v)]')+(l+—') \\ 

=: —(#1 + (?2 + <^3 +(7 + Eb ) <^ 4 ) ■ 

7 


Note that ^2 = 0 and since E^{uh) G H^(n) all jump terms from (4.7) vanish and we see (03 = 0 


(ai we see in view of Cauchy-Schwarz 


(4.13) 


S’l= sup [lh{v - Vh) - £/h{uh,V - Vh)] 

j;GL 2 (n),||?;||L 2 (fi)<l 


sup 

i;GL2(n),||?;||L2(r!)<l 4 


{f - A:Dluh){v-Vh) + f |VuJ^:| 24 Pfc(u - w/j)} 

Js' 

-ahe [ [Vu/,] IV(Pkiv - Vh))l - crh~^ ( {uhl ■ {Pkiv - Vh)l 
Jg Jg 


< 


sup 




’'SL2(0),||d||l2(s2)<1 ^ xeS 

+ ( III^“JllL2(e)(ll{{ - 4 Pfe(u-U,i)}||L,(e) + ^^eWlV {Pkiv - 


eeg 


■Crh/ |||u4|lL,(e) III-Pfc(^^-^'h)lllL2(e) 


G V be the 


of Theorem 


lh{v)] 


e^ll 

^ llH2(n) 


. To control 
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Now choosing Vh = 0, using trace and inverse inequalities and the stability of Pk in L 2 we have 


S‘i< sup ( \\f - A:Dluh 

t)GL2(n),||ti||L2(n)<l \ 


L2(iC) ll‘'llL2(iC) 

+ CY{ ll^fc(^)llL2(iC) + W^PkvW U{K) 


e£K 

IIKlllL,rel h-Y"\\PkV\\ 


L2(-ft:) 


(4.14) 


< 


sup 


llL2(r2) 


........... Y. 

?jGL2(n),||f||L2(n)<l V 

+ c^ (chY^\\lVu4\\^^^,) + h-Y^\\lunl\ 

eeK ^ 


UiK) 


L2(e) 


Kesr \ eGK ^ 


L2(e) 


□ 


To conclude we use the reconstruction bounds given in Lemma 3.6 to control S '4 as required. 

4.5. Proposition (Lower bounds). Under the conditions of Theorem 4.4 the following global a posteriori 
lower bound holds: 


(4.15) 

where 

(4.16) 

(4.17) 


Y hR + Y'^j- “ ^hUh\ 

Kesr ecs 


lL2(a) 


hR ■= \\f - A-.D'^Uh 

Vj ■= WV^UhlWYe) +h 


h2iK) 

-3 HIT.,. Hl|2 

Lale) • 


Proof The bound on the interior residual is trivial since the error and residual are being measured in the 
same norm there is no need to use bubble functions to invoke inverse inequalities. Indeed, 

(4.18) 


|/ ILain) — ll^llLoo(n) 11^ P^h'^h\ 


Lain) 


For the jump terms we may use a similar argument to that given in the proof of Corollary |3.24 We choose 
Vh = Uh — E^{uh) in (4.4) and then by definition of the scheme 

(4.19) 


^ IIKl|lL,(e) < ^ IIIVuJIIl2(^) II - ys^iuh)} ||{{ Uh - E^iuh)}\ 

+ ||/ ~ ^•D/iU/t||L2(Q) W^h — E (w;i)||L2(Q) ■ 


L 2 (S’) 


Now using the properties of E^ from Lemma 3.6 we have 

(4.20) ah-^ IIK1IIl2(<?) ^ c'^III^wJIIl2(-?) IIK1IIl 2(^) + ll/- ^-D^^/illLaiif) 
Now using the fact that 

(4-21) K^muh}\\Yg^=h-^mu-yuh}\\Y^^ < p^u-Uluh\\Y^^ 


L2(<^) 


together with (4.18) yields 

(4.22) hp^\\luhl\Ys)+hp^\\lVuhl\Y^pCp^^ 

concluding the proof. 


Lain) 


□ 
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4.6. Inconsistent nonvariational methods. We now introduce an inconsistent method for the approxi¬ 
mation of (4.1). This is a slight modification of that presented in [DP 13) and related to [LPllj . The method 
we consider is given as follows: Let H{uh) € be such that for all ^ € V 

(4.23) [ H{uhH= [ ^luh^- [ 

Jn JQ Jg 

then we seek the tuple {uh, H{uh)) such that 

(4.24) ■s^h{uh,Vh)= [ fvh Vd/i G Vn H^(.3^), 

JQ 

with 

(4.25) ^h{uh:Vh) := [ A:H{uh)vh-\- [ crhelVuhjlVvhj[ ah~'^ {uh} - {vh} • 

Jn Js JS 


4.7. Theorem (Stability of H{-) |Pryl4[ Thm 4.11]). Let H{-) he defined as in (4.23), then it is stable in 
the sense that 

(4.26) IlD^z;^ - ^ I Wl' 

Consequently we have 

(4.27) \\H{vn)\\l^^y..<C\lvf.\lla,2- 


4.8. Theorem (Primal a posteriori error upper bound). Let u G H'^(f2) solve (4.2) and ut be the finite 
element approximation given by (4.24), then 

(4.28) 


1/2 


u-M;i||UG.2 < c Vr + Y^J 


where 

(4.29) 

(4.30) 


Kegr eeg 

4 := \\f-A:Hiun)\\Y^^ 

2 , 7-3 HIT.. ni|2 




Proof The proof is similar to that of Theorem |4.4[ Using the abstract result of Theorem |2.6| we proceed to 
bound each term. In view of Cauchy-Schwartz we have 



#1 = 

sup 

/ 

- A-.H{uh)){v- 

- Vh) 

(4.31) 


i;GL2(0).|b||L2(fj)<l 

JQ. 




< 

sup 

E 

\\f - A-.H{uh)\\, 

.■2{K) 



i;GL2(0).||d||l2(sj)<1 

Kesr 


Choosing Vh 

= 0 we have 





(4.32) 


#1 < C 

El 

\f - A-.H{uh)\\^ 

2(K) 


Kegr 

The consistency term #2 = 0. For the second consistency term we have that 
#3= sup - f A:{H(E^Uh))-D^E^Uh))v 

i;GL2(n),||77||L2(n)<l O' -'O 

<C'll^llL„(a) E \\HiE^M)-B^E^iuH)\Y^^ 


(4.33) 


Kes 


Kesr 


<C|l^llL^(a) E [\\H{E\un))-H{un)Y^^^ + \\H{uu)-T4lunY^^^ 












Making use of Lemmata |4.7| and |3.6[ we see that 

(4.34) 


K^sr 


< c- (^ h-^ II [V.uj||^^(,) + h-^ IIK11|^^(,) 

\e&S / 


1/2 


Again from Lemma |4.7| we have that 
(4.35) 


K^sr 

and Lemma [3)6| gives that 


^ ||ff(zi,) - < C ( ^ II iV.uJ|1^^(,) + h-3 

'<&sr Kees 


1/2 


2 

L2(e 


(4.36) 5] <C E^^'llI^'‘^JIlL(e)+*e-'llWllL(e 


1/2 


KeS 

Hence we have that 


KeeS 


1/2 


(4.37) 


VeG^ 


-3 


L2(e) 


We conclude by noting that from Lemma 3.6 

(4.38) A = \lEiuh) - un\UG.2 < c( ^ ||[V,i;J||^^(,) + h, 


1/2 


-3 


L2(e) 


\eGK 


Collecting (4.32), ( |4.37| ) and (4.38) yields the desired result. 


□ 


4.9. Proposition (Primal a posteriori lower bound). Using the notation of Theorem 4.8 we have for a large 
enough a lower bound of the form: 

^2„ 


(4.39) 


lL2(a) 


)■ 


y] m + '^hj < c(||d^m - P^M/ii 

Ke^ ees 

Proof Notice, as in the consistent case there is no data oscillation, this is since the inconsistency can be 
linked to the nonconformity via Lemma |4.7| We begin by noting 

11/ - A:fl'(u,,)||L^(j^) < II A:(d2u - + || A:(D^w,, - H(uh)) ||l^(j^) 

(4.40) < C(||d2u - + IlD^ii;, - Jf(u^)||j^^^^^) 

< cdlD^zi - D^,||^^(^^ + h-y/^ ||lV«,l|iL,(,,) + h-y/^ ||MIIl2K)) ’ 

in view of Lemma 4.7 Now, as in the proof of Proposition 4.5 making use of the scheme (4.24) with 
Vh = Uh- E^{uh) we see 

(4.41) 

IIKlliL(e) < IIIVuJ|Il 2 (^) - vi?'(«.)l||L,(^) 

+ 11/ - hh - ^^K)||L2(n) 

<C||[Vu4|l 

L 2 (S’) II Kill L 2 {S’) 

+ Ch^''^ IIK1 llL2(,f) (II llL(^) + II w llL(^)) • 

Rearranging the inequality we have 

(4.42) (a - C) h/3/2 ||K1|1 l,(,) < Ch^ ||IVM|1l,(^) + C . 

Choosing a large enough and using (4.21) concludes the proof. □ 
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4.10. Corollary (A posteriori bounds for the nonvariational problem including data approximation). Let the 
conditions of Theorem 4.8 hold. Let Uh now be the the solution of (4.24) with the following approximations 


of the bilinear and linear forms 

(4.43) s/h{uh,Vh) ■.= ^ Q^^~'^{A-.H{uh)vh) + '^Ql'"{uhe\^Uhl\^Vhl+(jh~^luhl-lvhl) 


Ke3r 




(4.44) 
then 

(4.45) 

and for a large enough 

(4.46) 

where 

(4.47) 

(4.48) 

(4.49) 

(4.50) 


1 {vh):= Y. QT\fvh), 


KGS' 


|||m - Uh\ldG,2 < C Y Vr + Vi^ + Y^J 
KKgST eGS / 


1/2 




Kesr 


eGS 


kgst 


:= \\Pk-2f - Pk-2{A:H{udmYj,) 
pj := \\{Pu-2 - ld){A-.H{uu))\\YK) + 11 / - Pk-2f\\YK) 


if/ := ||(Pfe _2 - ld){A-.H{E\un))) + 11/ - Pk- 2 f\f 


hj-=K +h 


L2(if) 

-3 I|ir„,. H||2 

L2(e) 


L2(a:) 


Proof The proof is the same as that of Theorem |4.8| with the exception that the inconsistency term is 
slightly more complicated. We have that 


S's — 


(4.51) 


sup - [ Pk -2 {A:H{E'^{uh))) V - A:I)'^E'^{uh)v 

{Pk- 2 {A:H{E\uh))) V - A:H{E^{uh))v) 

+ {A-.H{E^{uh))v - A-.D^E\uh)v) 


i;GL2(n),||«||L2(n)< 
sup 

«GL2(n),||i>||L2(f!)<i f “'n L 


The second term in (4.511 can be bounded using the same arguments as in the proof of Theorem 4.8 The 
other term is controllable by pi using Cauchy-Schwarz, concluding the proof. □ 

4.11. Remark (Comparison of the variational and nonvariational estimates). Note the different structures 
of the data approximation terms appearing in Lemma 3.21| and Corollary |4.10| They are different orders 
of approximation even though the underlying quadrature approximation is the same. This is because the 
estimators control the error in different norms due to the different stability frameworks of the underlying 
PDEs. The data approximation terms are of the same order of accuracy as the other residual terms using 
a quadrature of degree 2k — 2. Higher order quadrature choice naturally results in the data approximation 
terms becoming higher order, assuming smooth data. 

For the nonvariational scheme the inconsistency is in two distinct parts. The first being the nature of the 
scheme, which was tackled in Theorem |4.8| and is inherently tied to the nonconforming nature of the scheme. 
The second is the quadrature approximations. This inconsistency cannot be linked to the nonconformity of 
the scheme, thus another quantity enters into the estimator. 

4.12. Corollary (H^ a posteriori bounds for the Laplacian). Let A = —I then A:TP'u = —Am then the 
following a posteriori bound holds for Uh the approximation given by the IP method under quadrature 


approximation (3.66) 


(4.52) 


\\u-Uh\\dG,2<ciY'4R + hl + Y"^'^^ 
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where 

(4.53) 

(4.54) 

(4.55) 
and 

(4.56) 


Vr ■— \\Pk-2f + ^^^?i|Il2(a:) 
:= \\f - Pk- 2 f\\l^^K) 


r,j :=h-^\\lVunl\\l^,)+h 


-3 


2 

L2(e) 


E 

Kes 


r]R + J2'nJ< C[\\D^u - + 11/ - i^fc-2/|lL,(n)) 


eetg 


Proof The Proof consists of observing that the finite element method (4.24) coincides with the IP method 
(3.66) (albeit with an additional penalty term for the gradients) whenever A is (.^-wise) constant and 

□ 


applying Theorems |4.8| and 4.9 


4.13. Remark (Dual a posteriori error control and the formal dual of (4.1)). The formal dual problem is 

(4.57) £/*{u,v)= I uD^ivA) 

Jn 

and, to the authors knowledge, there is no general existence result for this class of problem. Under the 
stringent assumption that the dual problem is well posed, if Uh solves the nonvariational Galerkin method 
and u is a strong solution then 


(4.58) 


where 


l«-«/illL2(n) < C' XI ^ 2 . 

\KeS 


1/2 


+ E ^ 2 ,J 

eeS / 


V2,R ■— h'x 11 / ~ {uh)\\i^^(^x) 

vIj ■■= hi + K |IKl||L,(e) 

This result is included as, from a practical point of view, the estimators seem to be reliable and efficient. 

5. Numerical experiments 

In this section we present some numerical results testing both the asymptotic behaviour of the estimators 
derived in previous sections as well as its effectiveness as an indicator for adaptivity. 

5.1. Inconsistency terms from We begin this section with an adaptive numerical test problem for 
the estimator given in Lemma 3.21 We take the diffusion coefficient to be 

'1 0 

0 arctan ^1000^ P kY p{y ~ )) + 2 

and U = [0,1]^ — [i, 1] x [0, i] to be the standard L-shaped domain. We use a maximum strategy adaptive 
algorithm given in Algorithm 1 |SS05| and the estimator given in Lemma 3.21 We use k = 2 and take the 
quadrature degree to be order 2. Figure shows the numerical solution at the final level of refinement and 
Figure shows the mesh generated and estimator contributions at various levels of refinement. Note the 
solution will be singular at the nonconvex corner of the domain. In addition the problem data is chosen 
such that A has a very large gradient over a large portion of the domain which induces a further singularity. 
Both seem to be captured correctly using this algorithm. 


(5.1) 


A = 
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Algorithm 1 Maximum strategy type adaptive algorithm incorporating inconsistencies. 

Given constants Ci, (72, C 3 such that Ci + C 2 + = 1, tol and k; 

Set N = 0; 

while CiUfi + ( 72 H/ + > tol AND N < Max Iterations do 

for aW K & do 

Compute r]i^,rij and rji; 

+ ri\] 

Hz = H, + ry2; 

for all e G K do 
H,7 = Hz + 
end for 

Set r]K = r]ii+ \'qj + r?/; 
if Vk > Cmaxigjr (77^) then 

Mark element K for refinement; 

end if 

if 77if < Kmaxig^^ ( 77 ^) then 

Mark element K for coarsening; 

end if 
end for 

Rr = Hz = VH7, Hz = 

Perform any mesh change; 

N = N+1; 

end while 


Figure 1. The adaptive approximation to the solution of the diffusion problem with 
problem data given in §5.1[ 
u_h 
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Figure 2. Testing the estimator given in Lemma [3.21| as a driver for adaptivity. The prob¬ 
lem data is described in 15.1 We consider various iterates of the adaptive procedure looking 


at the mesh and the estimator components. The top estimator is the inconsistency terms 
and the bottom the interior and jump residual. Notice the inconsistency term is extremely 
localised to where the diffusion coefficient has large gradient and becomes comparatively 
negligible once the mesh is sufficiently resolved in that region. 



(a) Iterate 3 



(b) Iterate 4 




(c) Iterate 5 


(d) Iterate 6 



(e) Iterate 7 


(f) Iterate 8 
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5.2. Inconsistent nonvariational approximations from We are studying both the primal and dual 
estimators derived in ^for nonconforming, inconsistent schemes approximating nonvariational problems. We 
examine the asymptotic behaviour of the estimators from Theorems |4.8| and |4.13| and also their effectiveness 
as drivers for adaptive algorithms. 


Test 1 : Asymptotic behaviour. With d = 2 if A'.TP'u is a unifor mly elliptic operator then the Campanato 
condition for strong solutions is satished |Tar04) and Assumption |2.l| is satisfied. We fix = [i, use the 
problem data 


(5.2) 



0 

sin (27rx) sin {^ny) + 2. 


We have chosen A in this fashion such that it varies slowly over the domain and the inconsistency term 
arising from quadrature approximation is negligible. We choose / such that the exact solution is given by 
either 


(5.3) 


u{x) = sin (27ra:) sin (27ry) € C^(Sl) 


or 

(5.4) + gH>(n)/H»(SI). 

In Figure we summarise the results of this test and demonstrate that computationally the asymptotic 
convergence rate of the estimator is the same as that of the error as predicted by the theory given in 
Theorems 14.81 and 14.91 


Figure 3. Testing the asymptotic behaviour of the estimator given in Theorem 4.8 
problem data is described in §5.2|- Test 1. 


The 




(a) When u S C°° (O) is given by ( |5.3| l 


(b) When u S H^(0) / is given by |5.4i 


Test 2 : Adaptive algorithms. We fix = [\, as in Test 1, and choose A as in (5.2). We choose / such 
that the exact solution is given in (|5.4[). Note this function is H^(n) / H^(n). We use the adaptive algorithm 


given in Algorithm 1 and, in Figure 1^ examine the convergence of said algorithm. The rates are increased 
from those given in Figure]^ (approximately 0(A^“^/^)) to 0(A^“^/^) when measured in the H^(n) norm. 
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Test 3 : Quadrature effects, smooth solution. We fix O = as in Test 1 and 2, and choose 


(5.5) 
where 

(5.6) 


A = 


1 0 

0 aii{x),_ 


aii{x) = 


10 sin (IOOttx) sin (IOOttj/) + 11 if a;, y > 0 

11 otherwise. 


Notice the diffusion coefficient oscillates much more heavily in the upper right quadrant of than that of 
the previous tests. The is to force the inconsistency term arising from quadrature to have a larger impact. 


We choose / such that the exact solution is smooth and given in (5.31. We use the adaptive algorithm given 


in Algorithm 1 and, in Figure]^ examine the behaviour of the inconsistency term compared to the rest of 
the estimator. The increase in size and oscillations of the diffusion term force the inconsistency term in the 
estimator to dominate at the initial coarse mesh scale which results in a denser mesh in the upper right 
quadrant of 11. 

Test 4 : Quadrature effects, nonsmooth solution. This test is the same as Test 4, with the exception that 
the solution is given by (|5.4[). Figure [^summaries the results. 
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Testing the estimator given in Theorem |4.8| as a driver for adaptivity. The 
Test 2 such that the solution is given by (5.4). We 


problem data is described in [5.2 


consider various iterates of the adaptive procedure governed by Algorithm 1 looking at the 
mesh generated. Notice the mesh is refined around where the solution is nonsmooth and the 
convergence rate of the adaptive algorithm is quicker than the uniform counterpart from 
Test 1. 



(a) Iterate 1. 


(b) Iterate 11. 



(c) Iterate 13. 


(d) Iterate 18. 



(e) Convergence rates for the adaptive algorithm. 
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Figure 5. Testing the estimator given in Theorem |4.8| as a driver for adaptivity. The 
problem data is described in §5.2| - Test 3. In this case the solution is smooth and the diffusion 
coefficient oscillates heavily in the upper right quadrant. We consider various iterates of the 
adaptive procedure looking at the mesh and the estimator components. The top estimator 
is the inconsistency terms and the bottom the interior and jump residual. Notice after the 
mesh is sufficiently resolved where the oscillations are heavy, the inconsistency term becomes 
comparatively negligible. 



(a) Iterate 5 


(b) Iterate 6 



(c) Iterate 7 


(d) Iterate 8 



(e) Iterate 9 


(f) Iterate 10 
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Figure 6. Testing the estimator given in Theorem |4.8| as a driver for adaptivity. The 
problem data is described in §5.2| - Test 4. In this case the solution is not smooth and 
the diffusion coefficient oscillates heavily in the upper right quadrant. We consider various 
iterates of the adaptive procedure looking at the mesh and the estimator components. The 
top estimator is the inconsistency terms and the bottom the interior and jump residual. 
Notice the mesh is very resolved around where the solution is not smooth and also where 
the oscillations in the problem data are heavy. 



(a) Iterate 5 



(b) Iterate 6 



(e) Iterate 9 


(f) Iterate 10 
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